Insertion of characters in speech recognition

ABSTRACT

One embodiment provides a method, including: receiving, from an audio capture device, speech input; converting, using a processor, the speech input to machine text; receiving, from an alternate input source, an input comprising at least one character; identifying, using a processor, a location associated with the machine text to insert the at least one character; and inserting, using a processor, the at least one character at the location identified. Other aspects are described and claimed.

BACKGROUND

The portability of information handling devices (“devices”) has allowedusers to use devices anywhere. Some users prefer to provide input forthese devices, for example, smart phones, tablets, smart watches,personal computer, laptop computers, and the like, using speech to texttechnology or speech recognition software. As speech to text technologyhas become more accurate and easier to use, users are employing thesoftware to provide long bodies of text rather than short messages, forexample, email dictations rather than short text messages. To makeaccurate dictations using the speech to text technology the user mustprovide a speech command in order to input punctuation into the body oftext. For example, the user must say the word “comma” rather than typinga “,” as in traditional input methods.

BRIEF SUMMARY

In summary, one aspect provides a method, comprising: receiving, from anaudio capture device, speech input; converting, using a processor, thespeech input to machine text; receiving, from an alternate input source,an input comprising at least one character; identifying, using aprocessor, a location associated with the machine text to insert the atleast one character; and inserting, using a processor, the at least onecharacter at the location identified.

Another aspect provides an information handling device, comprising: anaudio capture device; a processor operatively coupled to the audiocapture device; a memory device that stores instructions executable bythe processor to: receive, from an audio capture device, speech input;convert, using a processor, the speech input to machine text; receive,from an alternate input source, an input comprising at least onecharacter; identify a location associated with the machine text toinsert the at least one character; and insert the at least one characterat the location identified.

A further aspect provides a product, comprising: a storage device thatstores code executable by a processor, the code comprising: code thatreceives, from an audio capture device, speech input; code that convertsthe speech input to machine text; code that receives, from an alternateinput source, an input comprising at least one character; code thatidentifies, using a processor, a location associated with the machinetext to insert the at least one character; and code that inserts, usinga processor, the at least one character at the location identified.

The foregoing is a summary and thus may contain simplifications,generalizations, and omissions of detail; consequently, those skilled inthe art will appreciate that the summary is illustrative only and is notintended to be in any way limiting.

For a better understanding of the embodiments, together with other andfurther features and advantages thereof, reference is made to thefollowing description, taken in conjunction with the accompanyingdrawings. The scope of the invention will be pointed out in the appendedclaims.

BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS

FIG. 1 illustrates an example of information handling device circuitry.

FIG. 2 illustrates another example of information handling devicecircuitry.

FIG. 3 illustrates an example method of insertion of characters inspeech recognition.

FIG. 4A-FIG. 4B illustrates an example of insertion of characters inspeech recognition.

DETAILED DESCRIPTION

It will be readily understood that the components of the embodiments, asgenerally described and illustrated in the figures herein, may bearranged and designed in a wide variety of different configurations inaddition to the described example embodiments. Thus, the following moredetailed description of the example embodiments, as represented in thefigures, is not intended to limit the scope of the embodiments, asclaimed, but is merely representative of example embodiments.

Reference throughout this specification to “one embodiment” or “anembodiment” (or the like) means that a particular feature, structure, orcharacteristic described in connection with the embodiment is includedin at least one embodiment. Thus, the appearance of the phrases “in oneembodiment” or “in an embodiment” or the like in various placesthroughout this specification are not necessarily all referring to thesame embodiment.

Furthermore, the described features, structures, or characteristics maybe combined in any suitable manner in one or more embodiments. In thefollowing description, numerous specific details are provided to give athorough understanding of embodiments. One skilled in the relevant artwill recognize, however, that the various embodiments can be practicedwithout one or more of the specific details, or with other methods,components, materials, et cetera. In other instances, well knownstructures, materials, or operations are not shown or described indetail to avoid obfuscation.

Speech to text technology has become an increasingly popular way ofproviding input to an information handling device. The technology hasbecome more accurate so some users prefer to provide speech input overtraditional methods, for example, using a touch keyboard, conventionalkeyboard, keypad, mouse, and the like, even in situations where the userhas the ability to provide manual input. Since users are increasinglyusing the technology to provide longer bodies of text, rather than shortmessages, the users may desire correctly punctuated bodies of text. Theissue with current speech to text technology is that inputtingpunctuation, or other characters which are not letters, is typicallydone through speech command. In other words, the user has to speak theword associated with the desired punctuation, for example, “exclamationmark” rather than just typing “!”. Punctuation and symbol insertion bycommand diction is both unnatural and disruptive to normal thoughtprocesses. Additionally, in some cases, the speech to text technologymay not support the input of some characters, for example, emoticons.

One currently employed solution relies on software algorithms thatpredict punctuation based upon silence. In other words, if a user pauseswhile providing the speech input, the technology may recognize thatsilence as a comma or a period. The issue with this is that this methodcan be inaccurate. For example, if a user has started a thought but thenpauses to think further, the technology may interpret this as a locationfor a period, but the sentence may not be finished. In an effort toassist in inaccurate placement of punctuation, some speech to texttechnology uses context in order to assist in eliminating this issue,but again, this tends to be inaccurate. Additionally, the punctuationchosen by the software may not be the desired punctuation of the user.For example, the user may desire an exclamation mark, but the softwaremay instead use a period.

Another solution is to allow a user to provide input using aconventional type input method, for example, keyboard, touch keyboard,numeric pad, mouse, and the like, in conjunction with the speech to textinput. However, while the current method does allow a user to providemanual input, the input can only be provided after the text has beendisplayed. In other words, because the speech to text technology has adelay time between the user speaking and the text appearing, in order toprovide the punctuation using the manual method, the user has to waitfor the text to finish appearing. This causes the user to have to waitand disrupts the train of thought, which can become very frustrating.

These technical issues present problems for users in that providingcorrectly punctuated bodies of text may be difficult using conventionalspeech to text recognition software. Conventional speech to textrecognition software requires that a user provide a verbal punctuationor symbol dictation corresponding to the desired punctuation or symbol.Software algorithms that predict punctuation based on silence and wordcontext are not always accurate. Additionally providing characters otherthan letters, for example, symbols, numbers, emoticons, and the like,may be difficult if not impossible with current speech to texttechnology. The current method for allowing a user to provide atraditional input with regard to the symbols requires the user to waituntil the speech to text technology has completed processing the speech.These methods interrupt a user's thoughts and are cumbersome.

Accordingly, an embodiment provides a method of allowing a user toprovide a manual input comprising at least one character while providinga speech input. One embodiment receives a speech input and starts toconvert the speech input to machine text. When an embodiment receives,from an alternate input source, an input comprising at least onecharacter, a location for the at least one character within the machinetext may be identified. In one embodiment the input comprising at leastone character may be received while the speech input is received.Alternatively or additionally, the input comprising the at least onecharacter may be received while an embodiment is converting the speechto machine text.

When receiving the at least one character, an embodiment mayadditionally receive a time stamp associated with the at least onecharacter. In identifying the location for the insertion of the at leastone character, an embodiment may use the time stamp associated with thecharacter and identify a corresponding time stamp within the speechinput. One embodiment may, in addition to the time stamp, use context todetermine the best location for the at least one character. For example,if the time stamp for receipt of the second input and the time stampassociated with the speech input do not match, an embodiment may useadditional information to determine a best location of the at least onecharacter. An embodiment may then insert the at least one character atthe identified location. In addition to the placement of the character,an embodiment may provide an indication of where the character has beenlocated within the machine text. In other words, an embodiment may allowa user to provide manual input while an embodiment is converting speechinput to text, rather than requiring the user to wait until the systemhas finished processing the speech input.

The illustrated example embodiments will be best understood by referenceto the figures. The following description is intended only by way ofexample, and simply illustrates certain example embodiments.

While various other circuits, circuitry or components may be utilized ininformation handling devices, with regard to smart phone and/or tabletcircuitry 100, an example illustrated in FIG. 1 includes a system on achip design found for example in tablet or other mobile computingplatforms. Software and processor(s) are combined in a single chip 110.Processors comprise internal arithmetic units, registers, cache memory,busses, I/O ports, etc., as is well known in the art. Internal bussesand the like depend on different vendors, but essentially all theperipheral devices (120) may attach to a single chip 110. The circuitry100 combines the processor, memory control, and I/O controller hub allinto a single chip 110. Also, systems 100 of this type do not typicallyuse SATA or PCI or LPC. Common interfaces, for example, include SDIO andI2C.

There are power management chip(s) 130, e.g., a battery management unit,BMU, which manage power as supplied, for example, via a rechargeablebattery 140, which may be recharged by a connection to a power source(not shown). In at least one design, a single chip, such as 110, is usedto supply BIOS like functionality and DRAM memory.

System 100 typically includes one or more of a WWAN transceiver 150 anda WLAN transceiver 160 for connecting to various networks, such astelecommunications networks and wireless Internet devices, e.g., accesspoints. Additionally, devices 120 are commonly included, e.g., an audioinput device such as a microphone, an image sensor such as a camera, andthe like. System 100 often includes a touch screen 170 for data inputand display/rendering. System 100 also typically includes various memorydevices, for example flash memory 180 and SDRAM 190.

FIG. 2 depicts a block diagram of another example of informationhandling device circuits, circuitry or components. The example depictedin FIG. 2 may correspond to computing systems such as the THINKPADseries of personal computers sold by Lenovo (US) Inc. of Morrisville,NC, or other devices. As is apparent from the description herein,embodiments may include other features or only some of the features ofthe example illustrated in FIG. 2.

The example of FIG. 2 includes a so-called chipset 210 (a group ofintegrated circuits, or chips, that work together, chipsets) with anarchitecture that may vary depending on manufacturer (for example,INTEL, AMD, ARM, etc.). INTEL is a registered trademark of IntelCorporation in the United States and other countries. AMD is aregistered trademark of Advanced Micro Devices, Inc. in the UnitedStates and other countries. ARM is an unregistered trademark of ARMHoldings plc in the United States and other countries. The architectureof the chipset 210 includes a core and memory control group 220 and anI/O controller hub 250 that exchanges information (for example, data,signals, commands, etc.) via a direct management interface (DMI) 242 ora link controller 244. In FIG. 2, the DMI 242 is a chip-to-chipinterface (sometimes referred to as being a link between a “northbridge”and a “southbridge”). The core and memory control group 220 include oneor more processors 222 (for example, single or multi-core) and a memorycontroller hub 226 that exchange information via a front side bus (FSB)224; noting that components of the group 220 may be integrated in a chipthat supplants the conventional “northbridge” style architecture. One ormore processors 222 comprise internal arithmetic units, registers, cachememory, busses, I/O ports, etc., as is well known in the art.

In FIG. 2, the memory controller hub 226 interfaces with memory 240 (forexample, to provide support for a type of RAM that may be referred to as“system memory” or “memory”). The memory controller hub 226 furtherincludes a low voltage differential signaling (LVDS) interface 232 for adisplay device 292 (for example, a CRT, a flat panel, touch screen,etc.). A block 238 includes some technologies that may be supported viathe LVDS interface 232 (for example, serial digital video, HDMI/DVI,display port). The memory controller hub 226 also includes a PCI-expressinterface (PCI-E) 234 that may support discrete graphics 236.

In FIG. 2, the I/O hub controller 250 includes a SATA interface 251 (forexample, for HDDs, SDDs, etc., 280), a PCI-E interface 252 (for example,for wireless connections 282), a USB interface 253 (for example, fordevices 284 such as a digitizer, keyboard, mice, cameras, phones,microphones, storage, other connected devices, etc.), a networkinterface 254 (for example, LAN), a GPIO interface 255, a LPC interface270 (for ASICs 271, a TPM 272, a super I/O 273, a firmware hub 274, BIOSsupport 275 as well as various types of memory 276 such as ROM 277,Flash 278, and NVRAM 279), a power management interface 261, a clockgenerator interface 262, an audio interface 263 (for example, forspeakers 294), a TCO interface 264, a system management bus interface265, and SPI Flash 266, which can include BIOS 268 and boot code 290.The I/O hub controller 250 may include gigabit Ethernet support.

The system, upon power on, may be configured to execute boot code 290for the BIOS 268, as stored within the SPI Flash 266, and thereafterprocesses data under the control of one or more operating systems andapplication software (for example, stored in system memory 240). Anoperating system may be stored in any of a variety of locations andaccessed, for example, according to instructions of the BIOS 268. Asdescribed herein, a device may include fewer or more features than shownin the system of FIG. 2.

Information handling device circuitry, as for example outlined in FIG. 1or FIG. 2, may be used in devices such as tablets, smart phones,personal computer devices generally, and/or electronic devices which maybe used to process speech input. For example, the circuitry outlined inFIG. 1 may be implemented in a tablet or smart phone embodiment, whereasthe circuitry outlined in FIG. 2 may be implemented in a personalcomputer embodiment.

Referring now to FIG. 3, at 301, an embodiment may receive from an audiocapture device (e.g., microphone, etc.), speech input. For example, auser may be providing speech input for dictation or as a command to adevice. In order to receive the speech input, the device may be runninga speech to text application. As an example, the user may have selecteda button which indicates to the device that the user is providing speechinput to be dictated by the device. Using the speech to text technologyor software, an embodiment may start converting, using a processor, thespeech input to machine text at 302. The conversion may be completedusing speech to text technology or software as known in the art.

At 303, an embodiment may receive, from an alternate input source (e.g.,conventional keyboard, touch keyboard, number pad, mouse, etc.), aninput comprising at least one character. The at least one character maycomprise a letter, emoticon, number, symbol, punctuation, and the like.If a user is running a speech to text application, the provision of thecharacter may occur while the application is still active. In otherwords, conventional speech to text application and software packagesrequire that a user stop the application in order to provide input usinga different method. Rather than requiring the user to stop theapplication, an embodiment can incorporate the provided manual inputwhile the application or software remains active.

The input may be received while the user is providing the speech input.For example, the user may say “How are you” and then write a “?” at atouch screen input and then continue providing speech input. In otherwords, while providing speech input a user may, rather than providing apunctuation dictation (i.e., saying “comma”) as may be required by thespeech to text technology, write a “,” at a touch screen input area.Alternatively or additionally, the input may be provided while thespeech to text technology is converting the speech input to machinetext. For example, a user may have already spoken “Hi my name is Jane”,as the speech to text technology is converting the text, the user maydecide they want a comma after “Hi”. At the same time the speech isbeing converted, the user may provide a “,” through a traditionalkeyboard.

While receiving the speech input, converting the speech to machine text,and receiving the character input, an embodiment may additionallyreceive or otherwise associate a time stamp with the data. For example,as the user is providing speech input an embodiment may identify a timestamp associated with each of the words or provided speech input. Thetime stamp may be created, identified, and/or stored locally on thedevice. Alternatively, the time stamp may be created, identified, and/orstored remotely, for example, on a cloud storage device.

At 304, an embodiment may identify a location associated with themachine text to insert the at least one character. In one embodiment thelocation may be identified using the time stamp associated with theinputs provided. As an example, referring to FIG. 4A, a user may providea speech input 401 stating “Today is Friday I thought it's onlyThursday”. After saying “Friday” the user may provide a manual input 402of a “?”, and after saying “Thursday” the user may provide a manualinput 402 of “!!!”. The manual input 402 of“?” may be associated with afirst time stamp 403 which corresponds to the first time stamp 403 inthe speech input 401, which is after the spoken word “Friday”. Themanual input 402 of “!!!” may be associated with a second time stamp 404which corresponds to the second time stamp 404 in the speech input 401,which is after the spoken word “Thursday”.

When using the time stamp for location identification, the time stampfor each input or conversion should be obtained from the same locationin order to ensure that the time stamp is consistent between the inputs.For example, if the time stamp associated with the character input isreceived from a local storage location, the time stamp associated withthe speech input should also be received from the same local storagelocation. However, if a local and remote storage location have syncedtime stamps, it is possible to pull the time stamps from multiplelocations.

One embodiment may use additional characteristics of the speech and/ormanual input to identify a location. As an example, one embodiment mayuse context characteristics associated with the speech input to identifya location for insertion of the character. This use of contextcharacteristics may be similar to current methods of determiningpunctuation using speech to text technology based upon context. Forexample, if the user provides a manual input that does not correctlycorrespond to a time stamp within the speech input (e.g., the userprovides the manual input a little before or after the correct locationof insertion), an embodiment may use additional methods to identify thecorrect location of insertion of the character. If additionalcharacteristics cause conflict between each other an embodiment maydefault to just using the time stamps to identify the location ofinsertion.

If the technology has already converted the speech to machine input, auser may still be able to provide input for character insertion. Forexample, if an embodiment has already converted “Are you coming” tomachine text, but the user wants to add a question mark, the user canselect the word “coming” and the provide a “?” input using a manualmethod. Selecting does not necessarily mean that the user selects thewhole word, but may rather include the user moving the cursor to adesired location of insertion. In selecting the word, the user does nothave to select the whole word, nor does the user need to move the cursorto the exact spot of insertion. For example, the user does not have toplace the cursor at the end of “coming” rather the user can just touchthe word “coming” and an embodiment may determine that the most logicalplace of insertion is at the end of the word rather than the middle.

The provision of the input may occur after the user has stoppedproviding speech input, or may occur while the user continues to providespeech input. For example, the user may say “Are you coming” andcontinue to speak while selecting the word “coming” and providing the“?” input. In other words, the user does not have to stop the speech totext application in order to provide the additional input. This can alsobe used to correct an improperly converted speech input. For example, auser may which to end a sentence with “?!?”but the system has onlyprovided a “?”. The user may select the “?” and provide the additionalinput to modify the machine text.

If, at 304, an embodiment cannot identify a location of insertion forthe character, an embodiment may take no action at 306. Alternatively,an embodiment may notify the user that a location cannot be identified,and allow the user to provide the correct location for the insertion ofthe character. This may also occur if an embodiment cannot correctdetermine the character that the user has provided. For example, if anembodiment cannot determine whether the user provided a “;” or a smileyface emoticon, an embodiment may take no action at 306 or may prompt theuser for additional input.

However, if an embodiment can identify a location at 304, an embodimentmay, at 305, insert the at least one character at the locationidentified. For example, referring to FIG. 4B, after identifying thelocation of the manual input as explained in connection with FIG. 4A, anembodiment may insert the characters into the machine text and result inthe output as shown in FIG. 4B. When using a touch input, after anembodiment has inserted the characters into the machine text, thewritten character provided by the user may be removed from the touchsurface and/or touch display.

An embodiment may additionally provide a notification to the user thatthe character has been inserted. For example, an embodiment mayhighlight the character, change the color of the character, circle thecharacter, or otherwise provide an indication of the inserted character.An embodiment may additionally allow a user to confirm the location ofthe character. Upon receiving no user input regarding the location ofthe character, an embodiment may maintain the location of the character.Alternatively, an embodiment may remove the character insertion. Thisaction (i.e., whether the character location is maintained or removed)may be set by default within the system or may be provided by the user.

The various embodiments described herein thus represent a technicalimprovement to conventional speech to text technologies. Using thetechniques described herein, a user can provide manual input at the sametime as speech input or while the system is converting from the speechinput to machine text. Thus, using the methods and systems describedherein, a user can provide a more natural and less disruptive way ofaugmenting speech input with symbols, emoticons, punctuation, numbers,letters, and other characters while using speech to text technology.

As will be appreciated by one skilled in the art, various aspects may beembodied as a system, method or device program product. Accordingly,aspects may take the form of an entirely hardware embodiment or anembodiment including software that may all generally be referred toherein as a “circuit,” “module” or “system.” Furthermore, aspects maytake the form of a device program product embodied in one or more devicereadable medium(s) having device readable program code embodiedtherewith.

It should be noted that the various functions described herein may beimplemented using instructions stored on a device readable storagemedium such as a non-signal storage device that are executed by aprocessor. A storage device may be, for example, an electronic,magnetic, optical, electromagnetic, infrared, or semiconductor system,apparatus, or device, or any suitable combination of the foregoing. Morespecific examples of a storage medium would include the following: aportable computer diskette, a hard disk, a random access memory (RAM), aread-only memory (ROM), an erasable programmable read-only memory (EPROMor Flash memory), an optical fiber, a portable compact disc read-onlymemory (CD-ROM), an optical storage device, a magnetic storage device,or any suitable combination of the foregoing. In the context of thisdocument, a storage device is not a signal and “non-transitory” includesall media except signal media.

Program code embodied on a storage medium may be transmitted using anyappropriate medium, including but not limited to wireless, wireline,optical fiber cable, RF, et cetera, or any suitable combination of theforegoing.

Program code for carrying out operations may be written in anycombination of one or more programming languages. The program code mayexecute entirely on a single device, partly on a single device, as astand-alone software package, partly on single device and partly onanother device, or entirely on the other device. In some cases, thedevices may be connected through any type of connection or network,including a local area network (LAN) or a wide area network (WAN), orthe connection may be made through other devices (for example, throughthe Internet using an Internet Service Provider), through wirelessconnections, e.g., near-field communication, or through a hard wireconnection, such as over a USB connection.

Example embodiments are described herein with reference to the figures,which illustrate example methods, devices and program products accordingto various example embodiments. It will be understood that the actionsand functionality may be implemented at least in part by programinstructions. These program instructions may be provided to a processorof a device, a special purpose information handling device, or otherprogrammable data processing device to produce a machine, such that theinstructions, which execute via a processor of the device implement thefunctions/acts specified.

It is worth noting that while specific blocks are used in the figures,and a particular ordering of blocks has been illustrated, these arenon-limiting examples. In certain contexts, two or more blocks may becombined, a block may be split into two or more blocks, or certainblocks may be re-ordered or re-organized as appropriate, as the explicitillustrated examples are used only for descriptive purposes and are notto be construed as limiting.

As used herein, the singular “a” and “an” may be construed as includingthe plural “one or more” unless clearly indicated otherwise.

This disclosure has been presented for purposes of illustration anddescription but is not intended to be exhaustive or limiting. Manymodifications and variations will be apparent to those of ordinary skillin the art. The example embodiments were chosen and described in orderto explain principles and practical application, and to enable others ofordinary skill in the art to understand the disclosure for variousembodiments with various modifications as are suited to the particularuse contemplated.

Thus, although illustrative example embodiments have been describedherein with reference to the accompanying figures, it is to beunderstood that this description is not limiting and that various otherchanges and modifications may be affected therein by one skilled in theart without departing from the scope or spirit of the disclosure.

What is claimed is:
 1. A method, comprising: receiving, from an audiocapture device, speech input; converting, using a processor, the speechinput to machine text; receiving, from an alternate input source, aninput comprising at least one character; identifying, using a processor,a location associated with the machine text to insert the at least onecharacter; and inserting, using a processor, the at least one characterat the location identified.
 2. The method of claim 1, wherein thereceiving the input occurs while converting the speech input to machinetext.
 3. The method of claim 1, wherein the receiving the input occurswhile receiving the speech input.
 4. The method of claim 1, wherein thereceiving an input additionally comprises receiving a time stampassociated with the input and wherein the identifying a locationcomprises identifying a corresponding time stamp within the speechinput.
 5. The method of claim 4, wherein the identifying furthercomprises identifying a location based upon context characteristics ofthe speech input.
 6. The method of claim 4, wherein the time stampassociated with the input and the corresponding time stamp within thespeech input are received from a single source.
 7. The method of claim1, wherein the speech input is received using speech recognitionsoftware and wherein the speech recognition software remains activewhile the input comprising at least one character is received.
 8. Themethod of claim 1, wherein the receiving comprises receiving input froma touch input source.
 9. The method of claim 1, further comprisingdisplaying the at least one character at the location identified and anotification indicating the at least one character has been inserted atthe location.
 10. The method of claim 1, wherein the secondary input isselected from the group consisting of: punctuation, emoticons, numbers,letters, and symbols.
 11. An information handling device, comprising: anaudio capture device; a processor operatively coupled to the audiocapture device; a memory device that stores instructions executable bythe processor to: receive, from an audio capture device, speech input;convert, using a processor, the speech input to machine text; receive,from an alternate input source, an input comprising at least onecharacter; identify a location associated with the machine text toinsert the at least one character; and insert the at least one characterat the location identified.
 12. The information handling device of claim11, wherein to receive the input occurs while converting the speechinput to machine text.
 13. The information handling device of claim 11,wherein to receive the input occurs while receiving the speech input.14. The information handling device of claim 11, wherein to receive aninput additionally comprises receiving a time stamp associated with theinput and wherein to identify a location comprises identifying acorresponding time stamp within the speech input.
 15. The informationhandling device of claim 14, wherein to identify further comprisesidentifying a location based upon context characteristics of the speechinput.
 16. The information handling device of claim 14, wherein the timestamp associated with the input and the corresponding time stamp withinthe speech input are received from a single source.
 17. The informationhandling device of claim 11, wherein the speech input is received usingspeech recognition software and wherein the speech recognition softwareremains active while the input comprising at least one character isreceived.
 18. The information handling device of claim 11, wherein toreceive comprises receiving input from a touch input source.
 19. Theinformation handling device of claim 11, wherein the instructions arefurther executable by the processor to display the at least onecharacter at the location identified and a notification indicating theat least one character has been inserted at the location.
 20. A product,comprising: a storage device that stores code executable by a processor,the code comprising: code that receives, from an audio capture device,speech input; code that converts the speech input to machine text; codethat receives, from an alternate input source, an input comprising atleast one character; code that identifies, using a processor, a locationassociated with the machine text to insert the at least one character;and code that inserts, using a processor, the at least one character atthe location identified.